Search CORE

Will systems biology offer new holistic paradigms to life sciences?

Author: A Giuliani
A Krishnan
AC Ahn
Alessandro Giuliani
AS Wilkins
AV Finkelstein
C Guill
CL Branden
Filippo Conti
H Kitano
Joseph P. Zbilut
K Klitzing Von
KA Dunker
M Tsuchyia
Maria Cristina Valerio
NS Holter
PK Dhar
R Benigni
RR Klevecz
S Stern
T Hirotani
Publication venue: Springer Netherlands
Publication date: 01/01/2008
Field of study

A biological system, like any complex system, blends stochastic and deterministic features, displaying properties of both. In a certain sense, this blend is exactly what we perceive as the “essence of complexity” given we tend to consider as non-complex both an ideal gas (fully stochastic and understandable at the statistical level in the thermodynamic limit of a huge number of particles) and a frictionless pendulum (fully deterministic relative to its motion). In this commentary we make the statement that systems biology will have a relevant impact on nowadays biology if (and only if) will be able to capture the essential character of this blend that in our opinion is the generation of globally ordered collective modes supported by locally stochastic atomisms

Microarray gene expression profiling and analysis in renal cell carcinoma

Author: AF Fergany
Alexandru Almasan
AN Young
Andrew A Novick
CV Denis
DJ Lockhart
DT Ross
EA Castilla
GH Gulob
H Moch
H Murer
H Suzuki
HC King
HS Sonmez
I Takemasa
J DeRisi
J Xu
JD Brenton
JM Boer
John Hissong
Joseph A DiDonato
JPT Higgins
KM Yamada
L Belov
Louis S Liou
M Ashburner
M Schena
M Takahashi
M Zhou
Marek Skacel
ME Wall
MKS Yeung
N Obermuller
NS Holter
NS Holter
O Alter
P Russo
PC Walsh
Provash Sadhukhan
R Santer
RA Heller
RJ Amato
S Li
S Ramaswamy
S Suer
Sandy D Der
T Akashi
T Ebert
T Shi
Ting Shi
TR Golub
VE Reuter
Y Kariya
YL Zhao
Zhong-Hui Duan
Publication venue: BioMed Central
Publication date: 01/01/2004
Field of study

BACKGROUND: Renal cell carcinoma (RCC) is the most common cancer in adult kidney. The accuracy of current diagnosis and prognosis of the disease and the effectiveness of the treatment for the disease are limited by the poor understanding of the disease at the molecular level. To better understand the genetics and biology of RCC, we profiled the expression of 7,129 genes in both clear cell RCC tissue and cell lines using oligonucleotide arrays. METHODS: Total RNAs isolated from renal cell tumors, adjacent normal tissue and metastatic RCC cell lines were hybridized to affymatrix HuFL oligonucleotide arrays. Genes were categorized into different functional groups based on the description of the Gene Ontology Consortium and analyzed based on the gene expression levels. Gene expression profiles of the tissue and cell line samples were visualized and classified by singular value decomposition. Reverse transcription polymerase chain reaction was performed to confirm the expression alterations of selected genes in RCC. RESULTS: Selected genes were annotated based on biological processes and clustered into functional groups. The expression levels of genes in each group were also analyzed. Seventy-four commonly differentially expressed genes with more than five-fold changes in RCC tissues were identified. The expression alterations of selected genes from these seventy-four genes were further verified using reverse transcription polymerase chain reaction (RT-PCR). Detailed comparison of gene expression patterns in RCC tissue and RCC cell lines shows significant differences between the two types of samples, but many important expression patterns were preserved. CONCLUSIONS: This is one of the initial studies that examine the functional ontology of a large number of genes in RCC. Extensive annotation, clustering and analysis of a large number of genes based on the gene functional ontology revealed many interesting gene expression patterns in RCC. Most notably, genes involved in cell adhesion were dominantly up-regulated whereas genes involved in transport were dominantly down-regulated. This study reveals significant gene expression alterations in key biological pathways and provides potential insights into understanding the molecular mechanism of renal cell carcinogenesis

Boston University Institutional Repository (OpenBU)

Public Library of Science (PLOS)

Optimal In Silico Target Gene Deletion through Nonlinear Programming for Genetic Engineering

Author: B Faryabi
B Kofahl
BJ Bornstein
CA Maguire
Chung-Chien Hong
D Deutscher
Diego Di Bernardo
E Meir
GM Cole
H Alper
HG Dohlman
I Shmulevich
J Goutsias
J Lampinen
J Nakae
K Takahashi
M Song
MB Sticklen
Mingzhou Song
NS Holter
R Storn
TG Dewey
XL Zhan
Y Wang
YS Jin
Publication venue: Public Library of Science
Publication date: 01/02/2010
Field of study

Optimal selection of multiple regulatory genes, known as targets, for deletion to enhance or suppress the activities of downstream genes or metabolites is an important problem in genetic engineering. Such problems become more feasible to address in silico due to the availability of more realistic dynamical system models of gene regulatory and metabolic networks. The goal of the computational problem is to search for a subset of genes to knock out so that the activity of a downstream gene or a metabolite is optimized.Based on discrete dynamical system modeling of gene regulatory networks, an integer programming problem is formulated for the optimal in silico target gene deletion problem. In the first result, the integer programming problem is proved to be NP-hard and equivalent to a nonlinear programming problem. In the second result, a heuristic algorithm, called GKONP, is designed to approximate the optimal solution, involving an approach to prune insignificant terms in the objective function, and the parallel differential evolution algorithm. In the third result, the effectiveness of the GKONP algorithm is demonstrated by applying it to a discrete dynamical system model of the yeast pheromone pathways. The empirical accuracy and time efficiency are assessed in comparison to an optimal, but exhaustive search strategy.Although the in silico target gene deletion problem has enormous potential applications in genetic engineering, one must overcome the computational challenge due to its NP-hardness. The presented solution, which has been demonstrated to approximate the optimal solution in a practical amount of time, is among the few that address the computational challenge. In the experiment on the yeast pheromone pathways, the identified best subset of genes for deletion showed advantage over genes that were selected empirically. Once validated in vivo, the optimal target genes are expected to achieve higher genetic engineering effectiveness than a trial-and-error procedure

Principal components analysis based methodology to identify differentially expressed genes in time-course microarray data

Author: A Conesa
A Reverter
AA Alizadeh
BM Wise
C Cheng
C Koch
DK Slonim
DR McMillan
G Zhu
H Sakoe
I Simon
JD Storey
JE Jackson
K Nasmyth
M Koranda
MB Eisen
MR Fielden
MS Bartlett
ND Trinklein
NJH Small
NS Holter
O Alter
OG Troyanskaya
PT Spellman
R Tabibiazar
Rajagopalan Srinivasan
S Raychaudhuri
SE Calvano
Sudhakar Jonnalagadda
T Park
V Vinciotti
W Pan
Z Bar-Joseph
Z Bar-Joseph
Publication venue: BioMed Central
Publication date: 01/01/2008
Field of study

Abstract Background Time-course microarray experiments are being increasingly used to characterize dynamic biological processes. In these experiments, the goal is to identify genes differentially expressed in time-course data, measured between different biological conditions. These differentially expressed genes can reveal the changes in biological process due to the change in condition which is essential to understand differences in dynamics. Results In this paper, we propose a novel method for finding differentially expressed genes in time-course data and across biological conditions (say <it>C</it>1 and <it>C</it>2). We model the expression at <it>C</it>1 using Principal Component Analysis and represent the expression profile of each gene as a linear combination of the dominant Principal Components (PCs). Then the expression data from <it>C</it>2 is projected on the developed PCA model and scores are extracted. The difference between the scores is evaluated using a hypothesis test to quantify the significance of differential expression. We evaluate the proposed method to understand differences in two case studies (1) the heat shock response of wild-type and HSF1 knockout mice, and (2) cell-cycle between wild-type and Fkh1/Fkh2 knockout Yeast strains. Conclusion In both cases, the proposed method identified biologically significant genes.</p

ScholarBank@NUS

svdPPCS: an effective singular value decomposition-based method for conserved and divergent co-expression gene module identification

Author: AI Su
Andrea Edwards
AS Householder
BM Bolstad
D Zhu
Dongxiao Zhu
DW Huang
H Li
J Ihmels
J Ihmels
J Miquel
JD Storey
JJ Loor
JM Stuart
JM Zahn
Kun Zhang
L Kaufman
L Liu
L Mao
L Zhang
MB Eisen
ME Wall
ME Wall
ML Genova
NS Holter
O Alter
O Alter
O Troyanskaya
PJ Rousseeuw
S Keerthikumar
S Ramaswamy
SA McCarroll
SL Rea
T Hastie
T Ozawa
W Zhang
W Zhang
W Zhang
Wei Fan
Wensheng Zhang
X Wang
Y Sun
YH Yang
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Abstract Background Comparative analysis of gene expression profiling of multiple biological categories, such as different species of organisms or different kinds of tissue, promises to enhance the fundamental understanding of the universality as well as the specialization of mechanisms and related biological themes. Grouping genes with a similar expression pattern or exhibiting co-expression together is a starting point in understanding and analyzing gene expression data. In recent literature, gene module level analysis is advocated in order to understand biological network design and system behaviors in disease and life processes; however, practical difficulties often lie in the implementation of existing methods. Results Using the singular value decomposition (SVD) technique, we developed a new computational tool, named svdPPCS (SVD-based Pattern Pairing and Chart Splitting), to identify conserved and divergent co-expression modules of two sets of microarray experiments. In the proposed methods, gene modules are identified by splitting the two-way chart coordinated with a pair of left singular vectors factorized from the gene expression matrices of the two biological categories. Importantly, the cutoffs are determined by a data-driven algorithm using the well-defined statistic, SVD-p. The implementation was illustrated on two time series microarray data sets generated from the samples of accessory gland (ACG) and malpighian tubule (MT) tissues of the line W118 of <it>M. drosophila</it>. Two conserved modules and six divergent modules, each of which has a unique characteristic profile across tissue kinds and aging processes, were identified. The number of genes contained in these models ranged from five to a few hundred. Three to over a hundred GO terms were over-represented in individual modules with FDR < 0.1. One divergent module suggested the tissue-specific relationship between the expressions of mitochondrion-related genes and the aging process. This finding, together with others, may be of biological significance. The validity of the proposed SVD-based method was further verified by a simulation study, as well as the comparisons with regression analysis and cubic spline regression analysis plus PAM based clustering. Conclusions svdPPCS is a novel computational tool for the comparative analysis of transcriptional profiling. It especially fits the comparison of time series data of related organisms or different tissues of the same organism under equivalent or similar experimental conditions. The general scheme can be directly extended to the comparisons of multiple data sets. It also can be applied to the integration of data sets from different platforms and of different sources.</p

Xavier University of Louisiana: XULA Digital Commons

Multivariate curve resolution of time course microarray data

Author: A de Juan
A de Juan
BM Kim
Christopher P Allen
DM Rocke
DP Kreil
E Segal
ER Malinowski
H Kim
JH Jiang
L Liu
M Juanita Martinez
M Schuemans
M Van Benthem
Margaret Werner-Washburne
MCK Yang
MN Leger
NS Holter
O Alter
O Alter
O Alter
OS Borgen
P Lu
PD Wentzell
Peter D Wentzell
PJ Gemperline
PT Spellman
R Rajkó
R Tauler
R Tauler
S Bergmann
S Raychaudhuri
S Van Huffel
SI Lee
Sushmita Roy
T Ideker
Tobias K Karakach
W Huber
W Liebermeister
W Windig
WH Lawton
X Cui
Y Chen
Z Bar-Joseph
Publication venue: BioMed Central
Publication date: 01/07/2006
Field of study

BACKGROUND: Modeling of gene expression data from time course experiments often involves the use of linear models such as those obtained from principal component analysis (PCA), independent component analysis (ICA), or other methods. Such methods do not generally yield factors with a clear biological interpretation. Moreover, implicit assumptions about the measurement errors often limit the application of these methods to log-transformed data, destroying linear structure in the untransformed expression data. RESULTS: In this work, a method for the linear decomposition of gene expression data by multivariate curve resolution (MCR) is introduced. The MCR method is based on an alternating least-squares (ALS) algorithm implemented with a weighted least squares approach. The new method, MCR-WALS, extracts a small number of basis functions from untransformed microarray data using only non-negativity constraints. Measurement error information can be incorporated into the modeling process and missing data can be imputed. The utility of the method is demonstrated through its application to yeast cell cycle data. CONCLUSION: Profiles extracted by MCR-WALS exhibit a strong correlation with cell cycle-associated genes, but also suggest new insights into the regulation of those genes. The unique features of the MCR-WALS algorithm are its freedom from assumptions about the underlying linear model other than the non-negativity of gene expression, its ability to analyze non-log-transformed data, and its use of measurement error information to obtain a weighted model and accommodate missing measurements

DISCO-SCA and Properly Applied GSVD as Swinging Methods to Find Common and Distinctive Processes

Author: A Subramanian
A Tanay
Age K. Smilde
AK Smilde
Anna Tramontano
Bart De Moor
C Hennig
CC Paige
CF Van Loan
HAL Kiers
HAL Kiers
HAL Kiers
Henk A. L. Kiers
I Måge
IT Jolliffe
Iven Van Mechelen
J Ihmels
J Westerhuis
JA Hageman
JM Stuart
K Devarajan
K Lemmens
K Van Deun
KA Bernstein
Katrijn Van Deun
Lieven De Lathauwer
Lieven Thorrez
M Schouteden
Mariët J. van der Werf
Martijn Schouteden
ME Timmerman
MJ van der Werf
MW Browne
NS Holter
O Alter
P Howland
P Tamayo
RA van den Berg
S Bergmann
S Friedland
SP Ponnapalli
T Dahl
T Löfstedt
U Lorenzo-Seva
VK Mootha
Z Bai
Publication venue: Public Library of Science
Publication date: 01/01/2012
Field of study

BACKGROUND: In systems biology it is common to obtain for the same set of biological entities information from multiple sources. Examples include expression data for the same set of orthologous genes screened in different organisms and data on the same set of culture samples obtained with different high-throughput techniques. A major challenge is to find the important biological processes underlying the data and to disentangle therein processes common to all data sources and processes distinctive for a specific source. Recently, two promising simultaneous data integration methods have been proposed to attain this goal, namely generalized singular value decomposition (GSVD) and simultaneous component analysis with rotation to common and distinctive components (DISCO-SCA). RESULTS: Both theoretical analyses and applications to biologically relevant data show that: (1) straightforward applications of GSVD yield unsatisfactory results, (2) DISCO-SCA performs well, (3) provided proper pre-processing and algorithmic adaptations, GSVD reaches a performance level similar to that of DISCO-SCA, and (4) DISCO-SCA is directly generalizable to more than two data sources. The biological relevance of DISCO-SCA is illustrated with two applications. First, in a setting of comparative genomics, it is shown that DISCO-SCA recovers a common theme of cell cycle progression and a yeast-specific response to pheromones. The biological annotation was obtained by applying Gene Set Enrichment Analysis in an appropriate way. Second, in an application of DISCO-SCA to metabolomics data for Escherichia coli obtained with two different chemical analysis platforms, it is illustrated that the metabolites involved in some of the biological processes underlying the data are detected by one of the two platforms only; therefore, platforms for microbial metabolomics should be tailored to the biological question. CONCLUSIONS: Both DISCO-SCA and properly applied GSVD are promising integrative methods for finding common and distinctive processes in multisource data. Open source code for both methods is provided

University of Groningen

International Migration, Integration and Social Cohesion online publications

UvA-DARE

FigShare

Public Library of Science (PLOS)

Proceedings - University of Groningen

ARTS repository - University of Groningen